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Genomic sequence of the mouse villin gene reguiat ry s 



quences 





GATCTGGTGC 


ACCAAGGACA 


CTGTGGTCCC 


AGCACTGGGG 


AGGTGGAGGG 


AGGAGGGTCA 


60 




GAAGTTTAAG 


GTCATCCTTG 


GTTACATAGC 


AAGGTTTCAG 


CCAGCTTCAG 


CTACATGAAA 


120 




CCTTTGTTTG 


TTTGTTTGTT 


TGTTTTAAAG 


CATTAATAAA 


TAATACCATA 


AGGAGGTTGG 


180 




CAGTGGTGGC 


AGACACCTTT 


AATTCCAGTA 


TTCAGGAGGC 


AGAAGCAGGC 


AGATCTCTGT 


240 




GAGTTCGAAG 


TCAGCCTAGT 


CTGCAAAGCT 


AGTTCCAGGA 


TGGCAAGGGC 


TACACAGAGA 


300 




AACCTTGTCT 


CATAAAACCA 


AAGTAGTAGT 


AGTAGTAGTA 


ATGCCATAGA 


GAAAATTGGA 


360 




GTCCATTCAG 


GATGGACCAT 


CCTATAAGAT 


GATTCTCTTG 


ACCCAGGTAA 


GCTAATGTCA 


420 




TGGGGAAAGG 


GGATGGGACT 


GTCCTAGATT 


AAAAAGTGCT 


GAGGCGATGC 


CTATTCTCAA 


480 




TTTGATTCCA 






CCAAGAGAAG 


TGGAACTGGG 


ACT CTGGACT 


540 


p 


GAAGACGTGA 


CGGC CTTATA 


AACACTGGCA 


CTTATAAACA 


CTTATAAACA 


CTGGCACAGG 


600 




CGTTCAGGTT 


XGAAGATCAC 


TTTCAAACCA 


GAGAACAGAA 


AGTGCTCGCT 


CGTCCTCAGC 


660 


(39 


GTAGCGAGCA 


CTGGCTGCAG 


AAGAGTGATA 


TTTAGTGAAA 


GCTACCTTCA 


CAATATCTTT 


720 




GCACTTATCA 


CATACACGTG 


TCAAATGTGC 


TAACTCCCTA 


GTCCACAGAT 


GGCTGTTACA 


780 


kU 

y 


CTCGTTTCTG 


CTTTCCCATC 


TGGTTGACAT 


TTGTCAGAAC 


CAGAAATTAG 


AAATGTGGGT 


840 


f a i 


ATTTATTTGT 


GTGCTGAGGA 


CACCATCCAG 


GGCTTTTCAC 


ATTTCAGGCA 


CATGGTTTAC 


900 




TAACTGGGCT 


ACTTCTCCAA 


CGGTTTGAAA 


CCATTTGTTT 


TATATTTACT 


TATTTTGTGT 


960 



ill GCATGAGGTA GGCATGTATA CGTATGTATA GGAGTCATGC ATGTGGCTGC TACCCTCAAA 1020 



CTAGGCCTAA GGTAATCTTT AGTTTTTTAA AAATATATAT TCTCAGCCGG GTGTGGTGGC 1200 
ACACGCCTTT AATCCCAGCA CTTGAGAGGC TGAGGTGTAG GAATTATACA CACAGGCCAG 1260 



CCTGCGTGTA 


TGTCCGTGCA 


AGGGTGTCAG 


ATCCCTTGGA 


GCTGGAGTTA 


AAGACAGTTG 


1380 


TGATCACGCT 


GCCGTTACAG 


ATGCTGGAAA 


TTGAACCCAG 


GTGTCCCTAG 


AGAAGCAGGC 


1440 


AGTGCTCTTA 


ACTTCTGAGC 


CACCCCTCCA 


ACCCTGCTTT 


TAGAGACTCT 


TAACCTTTTG 


1500 


TGTAATGTGG 


GAACTGAGTG 


GATCTTGCAC 


TTACCAAGTG 


TGTGCTGCGC 


TGTAGCATCA 


1560 


CTGAGCCCGT 


ACCCACACGA 


CTAGTGGATA 


CAGTTTAAGG 


GCAAACACTT 


AACAATGACA 


1620 


ATAGTTGGAT 


AGAGTTTGAA 


TATAGTCCTG 


AGCTATTGGT 


TAGCGTGACC 


TTTGCTGTCC 


1680 


TTAGCATGTG 


CTGTGAGAAG 


ATAGAAAAAT 


GAAGACTTGA 


GTCTAGTCCT 


GGAACCCACA 


1740 


GAGGCAGGCG 


AGAACCCACT 


CCTGAAAGTT 


GTTCTCTGAG 


CTTCACATAC 


AACTTCACAT 


1600 




CTGGGGTGCA GAGCTTGGCC CTl 



TGTTTTTTCT TTATGTGCAC TGGTGT CTTA 13 20 
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AATAGTTACA ATGATAATAA TAATTAGTAA ATTCTTTTAA AAGGTATATG TTGGGAGGGA I860 
GAGATGGCTC AGCTTCCAGG AGCACTTGCT GCTCTTGCAG AGGACCTAGA TTCAGTTCCC 1920 
AGGACTCATA TGGTGGCTCA CAGCCATCTG TAAATCCAGT TCCAGAGGGT TCCACACCCT 1980 
CTTCTGGCCT CCACAGGCAC CACATACATA GTACACAGAC ATACATGCAG GCAAAACACC 2040 
CATACACACA TAAATAAATA AGGAAACTTA AAAGGTG CAT GTGTTGGTAA ACATTGTGCT 2100 
TACACATGCT GATTGAAGAC ATGTACAACG CACACACTGA AGAGGGATCT GGGGCTGGAG 2160 
AGATGGCTCA GCGGTTAAGA GCACTGACTG CTCTTCCGAA GGAAGGTCCT GAGTTCAAAT 2220 
CCTAGCAACC ACATGGTGGC TGACAACCAT CCATAATGAG ATCTGACACC CTCTTCTGGT 2280 
GCATCTGAAG ACAGCTGCAG AGCTACAGTG TACTTAGATA TACTAATAAA TAAATCTTTT 2340 
TTTAAAAAAA TGAAGAGGGA TCTGAGACAC CTCAAAAGAG ATTATGAGCA GTGACTCACG 2400 
GGTGATTATC TATCCTGGAG TTTTTCCTTT CCGCTTGGCT TGCAACTGGG TGGACAGACG 2460 
CCCCTTTTCA TTCACAAGAA CGGGTGCTAC ATTATTTCTG AACAAAACAG CACCTGCAGT 2520 
ATGTTTACTG TCCTTGCTGA CTATGAGCAC GCGCACGCGC GCGCGCACAC ACACACACAC 2580 
ACACACACAC ACACACACAC ACACACACAC ATTCAGTCTC CAGAGCTCTT GGGAAGGTCA 2640 
AGAAGAGGCT GCCCTCAAAC ACGATCTTCA TCTTTCCCTC CTAAAGGAGA CCACGATTCC 2700 
AAGGTGGCAG AAGATCTACA GGGGGCAGAG QCAGGGAGQG GGAAGCAGGC CATGGTTTCC 2760 
AGAGACCTAC AGCAGAGGGC AGCAAGGCAG ATCCCCAGGT CCAGGGCAGG GAGGTGGAGG 2 820 
CCCTTGTTCC GAGGAGAAGG CAGGCGGCAG AACAGGGTTC AAAGGCACAG GTTTATGGCA 2880 
GCTCATAAAA GTGGAGGTCG TGGCTCACTC AGAAAGGAGG AAGAAGGGAA AGGCCCTTGT 2940 
GCCCACTGAG CGAGGGTCAT GCTGAGTAGG AGAGATCTGC AGGGGTGCCA GGAGCCCCAC 3000 
CTGTCTGTCC CAAGGGAACC CCAAGTGTGA ACTCTGGCCT TGGGTGCTGA GTTCCAGCTA 3060 
CAAGACCCCA GGAGTCCTAC TCCATCCCCA TCCAGTGCCC CCTCGCCCCG CCACACCCCA 3120 
CCCCCGACTC CCGTGCCACT TCTCTAGGGC TGGAGGGTGG CCAGCCCTGG TGGGGGTTGC 3180 
CTACCTGCAG GTAGAGCCCA GGTCCTAGCC GGAAGTGCAC CCCATCCCTG AAGCTGCAGA 3240 
GCCAAGGGCG GGGCACACGG CAGCTCAGGC TGTCAGGCTG TTGCTGGGCT CTAGGTTCCC 3300 
AGGGACCTGG GCACCTACTT CCCCACCCCC CCATCCATTC TCTCTGGGGC CCTATCTTCC 3360 
CTTATATGGT GAAGGAAGTT CCIGGGGGGG GGGGGTGGTG GTGAGGACAA AGGTCGTTCG 3420 
GTCTCCTGCA GCCAGCTTGC CACAACTTC C TAAGATCTCC CACZTTGGTGG CTGCCTCTTP 3480 

+1 exon 1 

(transcription start site) 

CASACAPGTA AGGCAATTGG GTGGGGACAC ATGGTGACCA CAGGTGGTTG GAGGGGACAG 3540 
GGTCCTTGCT TCTCTCTGGC AGCCTGTGCT TTCTGTAGCA CCTTGGTATA AGTTTGGGGG 3600 
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TGAGGTAAGG TGCTCTGAAA CTCTGAAAGA AGCAAGAAGC CAGCAGGCTG TCTTGGGCCT 3660 
TCAATGAAGG AAGTTCACAG ACCCCCTTTC CTGTAAGTCA CCTTCGCTTC ATCTGTGXAG 3720 
ATTCCCTGGG ACCAAGGTGG CTCCTGGGkC TCAGATTTCT ACAATTAAAA TCAGGACAGT 3780 
CCTGAGACTT GGACTCCGTG CCTGTATTTA CTACTTCTCT CTGGCTGCTC ATTTCTGTGT 3840 
TCATGTCTTA CACATCTGAA ATGGTTTCTT TGTGTCACCA TTCCCCTGAC ACTCCTGGGA 3900 
GGTCGTATCC TTGGCACATG TATCCTGGGA TGTAAGCTGC AGCCACCAGG AGAGAGGGGG 3960 
AGAGTCAGGA GCTGTGTCCT AGGCCCTATT AGGCCTGGAC ATCACCCCTT TCCTAGAAAT 4020 
GGCCCCTCCA TTTTTCGGTT ACCATGATCT ATTTTATATC AGAGTGGGCA GTGAAAGCCA 4080 
AACCTGCCCA GAAGTTTGGG ACTCACTCAG ACCAAGGTTA TCTGCTCAGA AATCCCCCTG 4140 
TCACTTGAGG TTGGGAGAAT CTGCCTCTGG GGGCTTCCAG GTCTTGGTTA GCAGGAGGGT 4200 
ATCCTTTGTA TAGGGCATGA CCTAGTCTAT G GTG TT A CTA CATTCCTGTC CAGTTAAAAG 4260 
CTGGAACTAA AACCCACGGC AGCGCCCAGG ATTCTCTACA GTTGTACCCC AAGAACAACA 4320 
AGACAGTAGA TATGCAAGGA TAGGTAGCTG GGGAGAAGAA GAACTTAAAC CCCCCCAAAG 4380 
GCCCACAGGT TCCGTTCCCT AGTTCACAAT GCCAGTATGA GTGCTAGCTA CTATGGGCTG 4440 
TGAGTTGGTA GCTACAAGCA TGAGTGATGT TCATGTGTGT AGTGTGTATA ATCTGAGCAC 4500 
TTGGGAGGCT GAAGCAGGAG GATTGCTATA TGTTTGAGGC CAGCCTGAGC TATAGAGCGA 4560 
GACTTTGTCT TTAAGAAAAA AATGAAAGCC CAGCAGTGGT GGCACACGCC TTTAATCCCA 4620 
GCACTTGGGA GGCAGAAGCA GGCAGATTTC TGAGTTCAAG GCCAGCCTGG TCTATAGAGT 4680 
GAGTTCCAGG ACAGCCAGGG CTACACAGAG AAACCCTGTT TTGAAAAACC AGAAAAACAA 4740 
AACAAAACAA AACAAAACAA AACCCAAACC CAAACCCAAA CCTCTCATCT CTCATCTCTC 4800 
TAGGCTGTGT CTGTCTAGGT GGTAGAGTTT GGGGACTTCA GACTTATATA TAAATAGGCC 4860 
TTTTTATCAC TGGTCAGAGA CGAGAAAGGT TTCAGTCTGG GACACAGTGG GACCCTGAGA 4920 
AAGTACTCCT TGCCAGCCCA AAAATTCTGG GAAGGCTTCC TGGAGGAAGT GTGTCCCGAT 4980 
CAGACTACTG TTCTAGAAGG CAGAAGAGAG GGTTGGAAGA ATGTTGGTGG ACAGACAGTT 5040 
GGAACAGAAG GACAGGAGGG GGAGGCATCC AAGATTCTGA ACATGTAGCT GACTTTTGGT 5100 
TCTCTGGGTG ACAAGTGTCC CCCAGGGATA GGGCTGTAGA AAGGGGACCA GGGGTGAGCC 5160 
AATGAGTTCA AGTTGAGGGA CACATCCAGC CCAGGGTCCT TGCTGGCAAG CTAAAGAATG 5220 
AGAGCCCTCT AACCCTCCCT GAAGTTTAGG GGAGACAGGA GAGCTGAGGA GATCCTTCTA 5280 
GGGTGAAGGA GAGGTATCTG CTCTGACCAA CATGGCTAGG AGCAGAAGCA GTTGGACCAG 5340 
TTACCCCTCA GAACCAGCCA TCCCCTCTTG GCTCTAAGGA GGCTGGGCCC CTTTCTGTTT 5400 
AAGAATCTTA CTTTTCTTCA GAGAGAGGCA GCAAGCCTTT GTCCCOTCCC TGTTGGTCAA 5460 
TAAACACCCC TGTGTGTAAC ATTAGTTTAT TTTACTGTCA GTTTGCTCCA GGACAGTCCA 5520 
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TCTGGTAGAC CTCTGCTCCT AACTCACCAA GGTATGGCCC ACATTCCTCA CCCAGAAGAG 5580 
TGCAGAAGAG AGCCTTAGAG AAAGGGTAAC AGTAACAAAG ATGGCCAGAA TAAAACAAAA 5640 
ACTACTATCC TTTGT AC C CA AATTGGTTTT GCTGAACCAG GAGGGGGTGT GTGAGTGTAT 5700 
GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT 5760 
CTTGGGGGAC TTTTCATGCT AAAGAATATC TGATATTGGC GCCCATGCCA ACAGGGGTAT 5820 
TGGGGAGAGT CAGGCTTCTG CAAACACAGT AAGCTGCCGA AGATGGATTG GTGGCCTGAA 5880 
TCACCAAGGG GCAGGCTGAT CAGAGTGGAC AGAACATCAC AAGATAAGCC ACCCTGTGGG 5940 
GCTCAGAAGA GGGAGTTTAC AAGAGGTAAA GGCCAAGCCA TTTATTATCC AAGACATGAC 6000 
TCAAAATCAA AGTGCAAGGA GAGATTAGCT GGAGAGATGG GGCTGTCAGT GTGGGACACC 6060 
TGACCTTGCA CTTATTAGTC ACTAGGCCAA GGAGCAGTCA CAGAGGGTGA CTGGGTCCTA 6120 
CTCAGCTTGG AGCAGGCACG TGGAGAATGG GTGACCTCCA TCCTGATGGA GAGGGCTGAG 6180 
CACCACCAGG TACAAGTGTT CCCTGTGTCT GATGCCAGGA TTCCTGGCCA GTTTTCAAAG 6240 
GACTAAGGAC TCATCTCTGG TGGAAACAAA GTATCCAAGC CCTAAGCCCC ATTTTGGTCT 6300 
AATTAAATCA GAACCCCTGG GGATGCAGGC TCTGAGCAGC AGGAGCTTTT TAAAAAGCTC 6360 
CCAGGTGATT CTGATCAGCA GCTGGAACAA ACACAGCTAC AGGTTCAAAC AGAAAGAGGC 6420 
AAAGCTAGGG AAAGCTTGGG ATGGGGAGCC TTCTTCCAGG CCAGTAGATG GAGGCTGGTT €480 
AGCAGTGGTG GCAGCTTCTC TCTGCCTGTC ATATAGCTAT CCATCCACTC ATCCATCCAT 6540 
ACACCCACCC ATCCATTTAT GCACCCATCC TTCCATCCAT CCATCTATCC AGCTACCCAC 6600 
CGACGCATCC ATCCAAACCT TCCTTTTCTC CTTCTTTCTT TCTTTTTTCC TTCACTCATT 6660 
CATTTATCCA ACAGAGAACT GGTATTGTAC TAAATGTGGG AGATTTAATT AATTTTTAGA 6720 
AGCTCTGTTG ATTGACTGAT TGTGCATGTA TGTGGACAGG TACATACCAC AGCACACGTG 6780 
TGGCAATCGG AGAAAGGTTT TGGGTGTTGT TTTCTCTTCC CACCGTGTGG GTTCTGGGGA 6840 
TTGAACTCAA ATTATCGGGC TGGTGGCAAG TGTCTTTACC ACCGAGCCAT TTTGCTGACA 6900 
CATCATTATT ATTAGAAAGC ATCTTATGTA GTCCAGGCTG GCCTCAAGCT TGCTATGTCG 6960 
CCACGGATGA CCTTTAACTC CTGCTCTTCC AGCCTCCACC CGAGTGCTAG GTTTACAGGT 7020 
GTTCAACTGG TGAATGCCTT TAATCCCAGC ACTCTGTGGG GGGGGGGGGG GAGGCGGATC 7080 
CCTGAGTTGG AGGCCAGTTT GGTCTACAGA GTTTCAGGAT ACCTGGGGCT ATACAGGGAA 7140 
ACCCTATCCC AAACAAACAA ACAAACAAAC AAAAAATATT CTGTGCAATA ATCACAGAGA 7200 
TTAGAGGATA TTAGTAGGGT AGTAGGGCXG GTGAGGGAGA GTCATGCTTT CTTTTGTATT 7260 
ATAATAGTAA AGTACTCACA AGATGCATTA T CTATCTAT C TATCTATCTA TCTATCTATC 7320 
TATCTATCTA TCTACCTACC TACCTACCTA TCCATCCATC CATCXATCGT ATAGCCCAGG 7380 
CTGCTTTGAC TCTGAATGCT CCTATTTCTG GGTCAACTCT TCACCCCTAG TGTTGGGTTT 7440 
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ACCAACACCC AGACATTTAT TTTATTTTGT TTTATTTTAT TAATCTAGGA GCTCAGGGTG 750O 
GGACTCAGGG TCTTGTGCAT GCTAAGCAAG CTCTCTGCCA CAGAGCTGCA GCTCCAGTCC 7560 
CCA T T TT GTT CAGGTGACTC TGTGACAGTT GTCATATTCG CAGCGCTATG TAGCTCTCTC 7620 
CACCTCCCAG TTCCAGCACT TTCTGGTCAT CCCAGTGGGC GGGCAACTCT GTGCTCACCA 7680 
GTGCCCTGTT CCCTGTCTTC AGACCTACAT ATTTGCCTGT CTGAACAGTT CATGTAAATG 774 0 
GGATGCGTTC CTGTGTATTC TTTTATGGCT GGCCCCTTTA TCTTAGCACA GTTTGTGTTG 7800 
GGCCATGTGT CACTGCTATA CTCTATCTTA TCATCATCTT ATGGCTTAAT AGTGTTCCTT 7860 
TGTGTGGATA AACCACTTTC TGTTTCATTT ACTGATGGAA ATTTGTGGCC CCACCCCCAC 7920 
CCTTTTTTTT TTTATTTGAG ACAAGGTCTT TCTGTGTAAT CTTGCAATCT TGGCTGTCCT 7S80 
GAGCTCACTC TGTAGACCAG GCTGTGAGGC TGTCCTTCCA CTTTTGACAC TCCTGTGAAG 8040 
AGAGTAGCCA TGAACTTCAA AGACAATTTT CTGTTTTGGT TTGTTTTTTA CATTTGTGTG 8100 
TGTATGCGTG TATATGTGCA TGTTTGTGTC TTCAGGTGCT CACATGTGTG TACCTGTGTG 8160 
TGGGAGAGAG AACAAACCGA TGTGCCATTC CTCAGATACT ACGCATCTTG TTAATATGTA B220 
TGTATTATGT ATGTTTATTT AGTGTGCCCA AGTATGCAGG TATTTTGTTG GAGTTTTCAC 8280 
CTTCC C TT GT GGGCTCTCCG CATTAAACTC AGCTCCTCGG GCTAGTGAGC AATGCCTTCA 8340 
CTCGATGAGC CATCTCGCTG CCCCTGCTGC CACCTCCTCC TTATTTCCCA GATGGGACTA 8400 
CGCACTGCAC TGGCCTAAAG CTCACCAAGT CATCCAGAGT GGCTAGCCAG GGAGACTCAG 8460 
GGATATGCTG GCCTCTGCCT CCACAGTGCT AGAATTACAG GCATACATCA CTGCTGGAAG 8S20 
ATTTTTAACC TGAATCCTGA GGATAGAGCA GGCACTCTAC GAATGGAGGG TTCTTTTTGT 8580 
GTTTGGTTTG GTTTCCTCTG CATAAGATCA GGCAGTCTGA AATAGTGTAG CCTGGGCTAC 8640 
ATAACATCTT GTCTCAAAAA GCCTATAGAG GTAGGGAGGT CGAGGCTAAA GAAGAGCCTT 8700 
AAGC CGGCTG TGATAGCACA CAGGATAGCC TGCACTATAT AGCAAGACCT TGTTTCAAAA B760 
ACATGGAGGG AGGGGTATGT TTTAAGTGCT GGGCTGTGTA ACAGGCACTA AGGGAGCCAA 8820 
TGTAGACATT TGACTAAGAA AGGATCATCA TCAAAGCCGG GTGGGCAGGG TAGAGGTTGG 8880 
ACTACAGTGG TCAAGACCCC CATAGGAAGC CAGTTTCCCT TCTTCCTCTG GGCCTCAAGC 8940 
CTGGCTCGAC GGCCACTGCT CTCACATGCC TTCTC CTCTA GGCTCGTCC^ g CATfl 899S 

exon 2 



SUBSTITUTE SHEET (RULE 26) 



WO 00/34492 



PCT/EP98/OS009 



12 / 15 




SUBSTITUTE SHEET (RULE 26) 



WO 00/34492 




PCT/EP98/08009 



13 / 15 



Targeting of oncogenes 
and tumor suppressor genes 



1 vitltn 



promoter 



i 



transgenic mice 




tumors of digestive epitheliums 
expressing vflffn 



3 



^ c oo per a tivity between oncogenes^ 



development of animal models of 
colorectal carcinogenesis 



FIGURE 8A 



SUBSTITUTE SHEET (RULE 26) 



WO 00/34492 




PCT/EP98/08009 



14 / 15 



Targeting of immortalizing gene 



'■U 



W 

in 




c 



to vBBn positive tissues) 



primary Bxpiants 
at the permissive 



•stabfishfnent of cell fines 
t the digestive tract 



FIGURE 8B 



SUBSTITUTE SHEET (RULE 26) 



WO 00/34492 




PCT7EP98/08009 



15 / 15 



!□ 

I 

liJ 

in 

! 

H 



I* 



Targeting of transactlvator gene 
(repressor form rtTA) 



L ViHin 



rtTA 



] 




bMransgenic mouse 

rtTA 4A; Pmin-X *A 




+ tetracyclm / ctoxycycJm 



(frw*UCttete expmaKrion of heterologous 
in the intestinal mucosa 



) 



spatio-temporal control of a toxic gens 
Or a lethal gene during development 



FIGURE 8C 



SUBSTm/TE SHEET (RULE 26) 



